On normalized compression distance and large malware
نویسندگان
چکیده
منابع مشابه
Normalized Compression Distance of Multiples
Normalized compression distance (NCD) is a parameter-free similarity measure based on compression. The NCD between pairs of objects is not sufficient for all applications. We propose an NCD of finite multisets (multiples) of objacts that is metric and is better for many applications. Previously, attempts to obtain such an NCD failed. We use the theoretical notion of Kolmogorov complexity that f...
متن کاملThe normalized compression distance and image distinguishability
We use an information-theoretic distortion measure called the Normalized Compression Distance (NCD), first proposed by M. Li et al., to determine whether two rectangular gray-scale images are visually distinguishable to a human observer. Image distinguishability is a fundamental constraint on operations carried out by all players in an image watermarking system. The NCD between two binary strin...
متن کاملNormalized Compression Distance for Gene Expression Analysis
In this paper we show that the normalized compression distance can be applied to gene expression data analysis. Typically, microarray-based classification involves using a feature subset selection method in connection with a specific distance metric. The performance is dependent on the selection of the methods. With our proposed approach there is no need for feature subset or distance metric se...
متن کاملEvaluating the Impact of Information Distortion on Normalized Compression Distance
In this paper we apply different techniques of information distortion on a set of classical books written in English. We study the impact that these distortions have upon the Kolmogorov complexity and the clustering by compression technique (the latter based on Normalized Compression Distance, NCD). We show how to decrease the complexity of the considered books introducing several modifications...
متن کاملEffect of Image Linearization on Normalized Compression Distance
Normalized Information Distance, based on Kolmogorov complexity, is an emerging metric for image similarity. It is approximated by the Normalized Compression Distance (NCD) which generates the relative distance between two strings by using standard compression algorithms to compare linear strings of information. This relative distance quantifies the degree of similarity between the two objects....
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Journal of Computer Virology and Hacking Techniques
سال: 2015
ISSN: 2263-8733
DOI: 10.1007/s11416-015-0260-0